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(54) Hands-free telephone 



(57) The present invention provides a hands-free 
telephone wherein near-end caller voice detection infor- 
mation is provided with little error, and wherein electric 
power saving effects are improved, and also wherein 
the far-end caller does not receive an unnatural impres- 
sion. An echo canceller comprises: level calculating 
means 1 , 2. and 3 for calculating the level of reception 
signals, input transmission signals, and residual sig- 
nals; and adaptive filter 4 for inputting the reception sig- 
nals and transmission input signals and outputting 
residual signals; echo cancellation amount calculation 
means 5 for calculating the echo cancellation amount 
from the input transmission signal level and residual sig- 
nal level; voice detection threshold value control means 
6 for controlling the voice detection threshold value 
according to the reception signal level and the echo 
cancellation amount; voice detection means 7 for out- 
putting a voice detection flag using the residual signal 
level and the voice detection threshold value; attenuator 
control means 9 for controlling the suppression amount 
based on the reception signal level, and attenuator 10 
for performing suppression according to the suppres- 
sion amount. 
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Description 

BACKGROUND OF THE INVENTION 

Field of the Invention 

The present invention relates to a hands-free tele- 
phone which has an echo caricelier and an echo sup- 
presser for canceling or suppressing echo signals 
regarding telephones with loudspeakers, and particu- 
larly to voice detection and voice transmission control. 

Description of the Related Art 

Fig. 14 and Fig. 15 illustrate the overall configura- 
tion of a known hands-free telephone device in which a 
loudspeaker, a microphone for hands-free use. and an 
echo processing unit are connected to a cellular tele- 
phone to be used as a loudspeaker telephone. Fig. 14 
illustrates an arrangement wherein the cellular tele- 
phone is an analog telephone, and Fig. 1 5. a digital tel- 
ephone. 

In the following description, "echo processing unit" 
indicates an echo canceller or echo suppresser. ''Near- 
end caller" indicates the user of the loudspeaker tele- 
phone, and "far-end caller" indicates the party other 
than the near-end caller 

Rrst description will be made regarding the recep- 
tion system. With reference to the analog cellular tele- 
phone illustrated in Fig. 14. signals received by an 
antenna 29 are demodulated by a wireless unit 28 and 
become analog voice signals, and further are A/D con- 
verted into far-end caller digital signals by an A/D con- 
verter 27. thus becoming reception signals R(k). Here, k 
denotes the time for R(k). S(k), and S2(k). which are 
digital signals. 

Also, with reference to the digital cellular telephone 
illustrated in Fig. 15, signals received by the antenna 29 
are demodulated by the wireless unit 28 and become 
voice encoded data, and further are decoded by a voice 
•signal decoding unit 32, thus becoming digital reception 
signals R(k) which are voice signals. The reception sig- 
nals R(k) carry the voice of the far-end caller. 

. in the case of either, i.e., of the analog cellular tele- 
phone illustrated in Fig. 14 and the digital cellular tele- 
phone illustrated in Fig. 15, the voice signals pass 
through the echo processing urrit 25. and the analog 
voice signals which have been D/A converted by a D/A 
converter 24 are cast into the air by means of a loud- 
speaker 22. thus reaching the ear of the near-end caller. 

Next, description will be made regarding the trans- 
mitting system. In the case of either, i.e.. of the analog 
cellular telephone illustrated in Fig. 14 and the digital 
cellular telephone illustrated in Fig. 15. the signals input 
from the hands-free microphone 21 are A/D converted 
into digital signals by an A/D converter 23. thus becom- 
ing input transmission signals S(k). 

In the event that only the near-end caller is speak- 



ing, the input transmission signal S(k) is the voice of the 
near-end caller, and in the event that only the far-end 
caller is speaking, the input transmission signal S(k) is 
the reception signal R(k) which has been D/A con- 

5 verted, passed through the loudspeaker 22. and re- 
circuited through the hands-free microphone 21 as echo 
signals. Also, in the event that t>oth callers are speaking 
at the same time, this is 
voice of the near-end caller and the echo signal. 

10 Regarding loudspeaker telephones wherein the 
use does not use a hand set but rather uses a loud- 
speaker 22 and a hands-free microphone 21. the echo 
processing unit 25 functions to preverrt echo which con- 
sists of the reception signal R(k) being output from the 

15 loudspeaker 22 and passing around into the hands-free 
microphone 21 . i.e., the echo processing unit 25 acts to 
cause the echo component within the input transmis- 
sion signal S(k) so as to output only the voice of the 
near-end caller as the output transmission signal S2(l^. 

20 The cellular phone proper 30 is provided with voice 
detection means 33 for inhibiting transmission of air^ 
waves during periods in which the near-end caller is not 
speaking, as means for saving battery power, and in the 
case of the analog cellular telephone illustrated in Rg. 

25 14. the voice detection means 33 output a voice detec- 
tion flag FLG to the wireless unit 28. Accordingly, the 
wireless unit 28 performs voice transmission control 
wherein ainwaves are transmitted only at times that the 
voice detection flag FLG indicates presence of sound. 

30 Also, in the case of the digital cellular telephone 
illustrated in Fig. 15, the voice detection flag FLG is also 
output to the voice encoding unit 31 and noise canceller 
8 located between the wireless unit 28 and the echo 
processing unit 25, and noise cancellation processing 

35 and voice encoding processing is performed only at 
times that the voice detection flag FLG indicates pres- 
ence of sound, so that electric power consumption is 
saved not only at the wireless unit 28 but also at the 
voice encoding unit 31 and noise cancellers. 

40 Further, regarding examples of known echo cancel- 
lers, an example is described in "Hands-free conversa- 
tion with echo canceller" (OW Denki Kenkyu Kaihatsu, 
January 1989, #141. Vol. 56, No 1. pp 34-40). 

The problems facing echo cancellers will now be 

45 described. The known echo processing unit 25 pos- 
sesses functions for eliminating or inhibiting echo com- 
ponents, but does not have the function for outputting 
the voice detection flag FLG for voice transmission con- 
trol by the wireless unit 28. Accordingly, in order to per- 

50 form voice transmission control, it becomes necessary 
to perform voice detection of the near-end caller from 
the output transmission signals output of the echo 
processing unit 25 at the voice detection means 33 of 
the cellular telephone proper 30. However, in the event 

55 that the echo processing unit 25 is an echo canceller 
using an adaptive filter, the adaptivity of the filter is 
insufficient in cases such as described below. 

Such situations would be cases where conversation 
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using the loudspeaker telephone has just been initiated 
and the adaptivity of the adaptive filter is insufficient, or 
where there is motion such as the near-end caller mov- 
ing his/her body, causing change in the reflection state 
of the wavetbrnis from the loudspeaker to the hands- 
free microphone, consequently causing change in the 
echo path which the adaptive filter has set as the adap- 
tive object thereof. 

In such cases, an echo component remains in the 
output transmission signal, so in the event that voice 
detection of the near-end caller is performed based on 
the output transmission signal, the residual echo within 
the output transmission signal is mis-identified as the 
voice of the near-end caller. 

This problem is described with reference to Fig. 
16A and Rg. 16B. In these Figures, the state of the 
parameters within the voice detection means 33 
wherein the near-end caller is not speaking and only the 
far-end caller Is speaking are illustrated. Here, the 
method of detection of the near-end caller voice in the 
voice detection means 33 is as described below. 

The level of a certain past section wherein the 
measured output transmission signal level is low and it 
can be Inferred that the near-end caller is not speaking 
is averaged, and this value is used as the near-end 
•background noise level. Adding a margin to this back- 
ground noise level yields the voice detection threshold 
value Tl-I(k) for the near-erd caller voice. 

In this case, the near-end caller is not speaking, so 
the greater part of the input transmission signal S(k) is 
echo conrponent. In Fig. 16A, the echo component of 
the input transmission signal S(k) is canceled to a cer- 
tain degree by the echo canceller, and the level Ls2(k) 
of the output transmission signal S2(k). which is the 
residual signal following echo cancellation, is smaller 
than the input transmission signal level Ls(k) over all 
sections within Rg. 16A. 

The output transmission signal level Ls2(k) is 
smaller than the voice detection threshold value TH(k) 
until the residual echo within the output transmission 
signal S2(k) immediately following initiating of speaking 
of the far-end caller increases to a certain extent, but 
when the residual echo within the output transmission 
signal increases to the extent that the output transmis- 
sion signal level Ls2(k) exceeds the noise voice detec- 
tion threshold value TH(k), the voice detection means 
33 mis-identifies the residual echo as the voice of the 
near-end caller, and changes the voice detection flag 
FLG from 0 to 1 as shown in Fig. 16B. Consequently, 
not only does the transmission of sound by the wireless 
unit 28 consume unnecessary electric power, but the 
far-end caller hears an echo of his/her own voice. 

Next, description will be made regarding the prob- 
lems facing the echo suppresses This description will 
be made in regard to the problems which occur in the 
case that the echo processing unit 25 is an echo sup- 
presser which uses an attenuator. 

This echo suppresser is arranged such that in the 



event that it is indicated by the reception signal level that 
the far-end caller is speaking, a signal wherein the input 
transmission signal of the near-end caller is suppressed 
is output as an output transmission signal, and on the 

5 other hand, in the event that it is indicated by the recep- 
tion signal level that the far-end caller is no longer 
speaking, a signal wherein the input transmission signal 
of the near-end caller is not suppressed is output as the 
output transmission signal. 

10 This problem regarding the echo suppresser is 
described with reference to Fig. 17A and Fig. 17B. In 
these Figures, the state of the parameters within the 
voice detection means 33 wherein the near-end caller is 
not speaking and only the far-end caller is speaking. 

75 and also wherein the background noise is relatively 
great, are illustrated. 

First, the far-end caller initiates speaking, and when 
the input transmission signal level Ls(k) exceeds the 
voice detection threshold TH(I^ as shown in Fig. 1 7A. a 

20 signal wherein the input transmission signal S(k) has 
been suppressed is output from the echo suppresser as 
the output transmission signal S2(k), and the output 
transmission signal level Ls2(k) rapidly decreases to a 
level lower than the near-end background noise level. 

25 Also, the voice detection threshold TH(k) follows the 
output transmission signal level Ls2(k) in a somewhat 
delayed manner, and is reduced in accordance of the 
reduction in the output transmission signal level Ls2(k) 
with the reduction in the level of the background rv)ise. 

30 When the far-end caller finishes speaking, the echo 
suppresser disengages the suppression on the output 
transmission signal S2(k), so that the output transmis- 
sion signal level Ls2(k) suddenly increases and 
exceeds the voice detection threshokJ TH(k). The voice 

35 detection means 33 mis-identifies this as the voice of 
the near-end caller, and changes the voice detection 
flag FLG from 0 to 1. as shown in Fig. 17B. Conse- 
quently not only does the transmission of sound by the 
wireless unit 28 consume unnecessary electric power, 

40 but the far-end caller hears unnatural tjackground noise 
in response to ending of his/her own speaking. 

Depending on the configuration of the echo cancel- 
ler, there are arrangements wherein an attenuator simi- 
lar to the echo suppresser is provided along with the 

45 adaptive filter, but the same problems as with the echo 
suppresser described above occur with the echo can- 
celler, as well. 

Also, in cases where the echo processing unit 25 is 
an echo suppresser using an attenuator, and in cases 

so where an echo canceller is provided with an attenuator, 
the following problems occur in the noise canceller. 

Generally, noise cancellers perform estimation of 
background noise properties at times when the input 
signal to the noise canceller is comprised solely of back- 

55 ground noise, and control the amount of cancellation of 
noise based on the earlier-estimated background noise 
properties, according to whether the input signals are 
voice signals or not. What is important here is that in the 
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case that background noise properties estirhattoil is 
performed in a voice section, the noise cancellation 
results in deterioration of voice quality. 

However, in cases where the echo processing unit 
25 is an echo suppresser using an attenuator, and in s 
cases where an echo canceller is provided with an 
attenuator, Jhe ya^ ^etecti^on jpe^^^ 
raises a voice detection flag FLQ. so tiiat the noise can- 
celler 34 is not able to perform appropriate noise can- 
cellation processing but rather deteriorates the voice io 
quality, giving the far-end caller an unnatural impres- 
sion. 

SUMMARY OF THE INVENTION 

The present invention has been made in light of the 
above-described problenis, and accordingly, it is an 
object of the present invention to provide a hands-free 
telephone wherein, in the event that the functions of 
echo canceller, echo suppresser and voice transmis- 20 
sion control are combined and used, near-end caller 
voice detection information is provided with little error, 
and wherein electric power saving effects are improved, 
and also wherein the far-end caller does not receive an 
unnatural impression. 25 

In order to achieve the above objects, according to 
an aspect of the present invention, a hands-free tele- 
phone, comprises: a hands-free microphone which con- 
verts the voice of a near-end caller into electrical signals 
and outputs as input transmission signals; a loud- 30 
speaker which is provided near the atx>ve microphone 
and outputs reception signals which are the voice of the 
^r-end caller into the air as audible sound waves; an 
echo processing unit which outputs as an output trans- 
mission signal a residual signal comprised of the input 35 
transrhission signal of the near-end caller input through 
the aforementioned microphone, the echo component 
thereof having been removed, and also which outputs 
the reception signals from the tar-end caller to the afore- 
mentioned loudspeaker, and further which outputs a 4o 
voice detection flag based on comparison between a 
voice detection threshold value which changes accord- 
ing to change in level of the aforementioned reception 
signal and the level of the aforementioned residual sig- 
nal; and transmission control means for performing 4S 
transmission control of the aforementioned output trans- 
mission signals only when the aforementioned voice 
detection flag indicates presence of voice, the afore- 
mentioned transmission control being performed by 
receiving power supply only when the aforementioned so 
voice detection flag indicates presence of voice. 

Also, according to another aspect of the present 
invention, the aforementioned echo processing unit 
comprises an echo canceller, conprising: an adaptive 
filter for taking input of the aforementioned reception f il- ss 
ter and the aforementioned input transmission signal 
and outputting a residual signal comprised of the afore- 
mentioned input transmission signal with the echo com- 



ponent thereof removed; first level calculating means tor 
calculating the level of the aforementioned residual sig- 
nal; voice detection threshold value control means for 
controlling the aforementioned voice detection thresh- 
old value; and voice detection means tor outputting a 
voice detection flag which indicates the presence of 
voice when the level of the aforementioned residual sig- 
nal iexceeds the voice detectidh threshold value outpirt 
from the aforementioned voice detectton threshold 
value control means. 

Further, according to yet another aspect of the 
present invention, the aforementioned echo canceller 
further comprises second level calculating means for 
calculating the level of the aforementioned reception 
signal, and also the aforementioned voice detection 
threshold value control means controls the aforemen- 
tioned voice detection threshold value according to the 
change in level of the reception signal output from the 
aforementioned second level calculating means. 

Further yet, according to another aspect of the 
present invention, the aforementioned echo canceller 
further comprises third level calculating means for cal- 
culating the level of the aforementioned input transmis- 
sion signal and echo cancellation amount calculating 
means for calculating the amount of echo cancellation 
based on the level of the aforementioned input trans- 
mission signal arKi the aforementioned residual signal, 
and also the aforementioned voice detection threshold 
value control means controls the aforementioned voice 
detection threshold value according to the change in 
level of echo cancellation amount output from the afore- 
mentioned echo cancellation amount calculating 
means. 

Furthermore, according to yet another aspect of the 
present invention, the aforementioned echo canceller 
further comprises second level calculating means for 
calculating the level of the aforementioned reception 
signal, and also the aforementioned voice detection 
threshold value control means controls the aforemen- 
tioned voice detection threshold value according to a 
value of the reception signal level from which the echo 
cancellation amount has been subtracted. 

Moreover, according to an aspect of the present 
invention, the aforementioned echo processing unit 
comprises an echo suppresser, comprising: second 
level calculating means for calculating the level of the 
aforementioned reception signal; voice detection 
threshold value control means for controlling the afore- 
mentioned voice detection threshold value according to 
change in the level of the aforementioned reception sig- 
nal ; third level calculating means for calculating the level 
of the aforementioned input transmission signal; voice 
detection means for outputting a voice detection flag 
which indicates the presence of voice when the level of 
the aforementioned input transmission signal exceeds 
the voice detection threshold value output from the 
aforementioned voice detection threshold value control 
means; an attenuator for suppressing the aforemen- 
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tloned input transmissions signal according to a certain 
suppression amount and outputting suppression trans- 
mission signals; and attenuator control means for con- 
trolling the suppression amount set for the 
aforementioned attenuator based on the aforemen- 5 
tioned reception signal level. 

Moreover yet, according to another aspect of the 
present invention, the aforementioned echo canceller 
further comprises an attenuator for suppressing the 
aforementioned residual signal according to a certain 10 
suppression amount and outputting suppression trans- 
mission signals, and attenuator control means for con- 
trolling the suppression amount set for the 
aforementioned attenuator based on the aforemen- 
tioned reception signal level. ^5 

Further yet. according to another aspect of the 
present invention, the aforementioned echo canceller 
further conprises a noise canceller provided between 
the aforementioned adaptive filter and the aforemen- 
tioned attenuator, the noise canceller performing esti- 20 
mation of background noise properties at times when 
the voice detection flag output from the aforementioned 
voice detection means indicates that the near-end caller 
is not speaking, controlling the noise cancellation 
amount according to the aforementioned voice detec- 25 
tion flag and outputting to the aforementioned attenua- 
tor a noise cancellation transmission signal comprised 
of the aforementioned residual signal from which noise 
has been removed, and also wherein the attenuator 
suppresses the aforementioned noise cancellation 30 
transmission signal according to a suppression amount 
set by the aforementioned attenuator control means and 
outputs suppression transmission signals. 

BRIEF DESCRIPTION OF THE DRAWINGS 35 

Rg. 1 is an overall constructional diagram of the 
hands-free telephone relating to the present inven- 
tion, illustrating the case in which a loudspeaker tel- 
ephone using an analog cellular telephone is 40 
enrployed; 

Fig. 2 Is an overall constructional diagram of the 
hands-free telephone relating to the present inven- 
tion, illustrating the case in which a loudspeaker tel- 
ephone using a digital cellular telephone is 45 
employed: 

Fig. 3 is a block diagram illustrating the construction 
of the echo canceller within the echo processing 
unit 25A in a first embodiment according to the 
present invention; so 
Fig. 4A through Rg. 4C are explanatory diagrams 
of the signal levels for describing the operation of 
the echo canceller in the first emfc>odimerrt accord- 
ing to the present invention; 

Fig. 5 is a block diagram illustrating the construction ss 
of the echo cancelled with the echo processing unit 
25A in a second embodiment according to the 
present invention; 



Rg. 6A and Fig. 6B are explanatory diagrams of the 
signal levels for describing the operation of the 
echo canceller in the second ennbodiment accord- 
ing to the present invention; 
Fig. 7 is a block diagram illustrating the construction 
of the echo canceller within the echo processing 
unit 25A in a third embodiment according to the 
present invention; 

Rg. 8A through Fig. 8C are explanatory diagrams 
of the signal levels for describing the operation of 
the echo canceller in the third embodiment accord- 
ing to the present invention; 
Rg. 9 is a block diagram illustrating the construction 
of the echo suppresser within the echo processing 
unit 25A in a fourth embodiment according to the 
present invention; 

Rg. IDA through Fig. 10C are explanatory dia- 
grams of the signal levels for describing the opera- 
tion of the echo suppresser in the fourth 
embodiment according to the present invention; 
Fig. 1 1 is a block diagram illustrating the construc- 
tion of the echo canceller within the echo process- 
ing unit 25A in a fifth embodiment according to the 
present invention; 

Fig. 12A through Fig. 12C are explanatory dia- 
grams of the signal levels for describing the opera- 
tion of the echo canceller in the fifth embodiment 
according to the present invention; 
Rg. 13 is a block diagram illustrating the construc- 
tion of the echo canceller within the echo process- 
ing unit 25A in a sixth embodiment according to the 
present invention; 

Rg. 14 is a constructional diagram illustrating a pri- 
vately known but unpublished hands-free tele- 
phone, showing the overall construction of a 
loudspeaker telephone employing an analog cellu- 
lar telephone; 

Rg. 15 is a constructional diagram illustrating a pri- 
vately known but unpublished hands-free tele- 
phone, showing the overall construction of a 
loudspeaker telephone employing a digital cellular 
telephone; 

Rg. 16A and Fig, 168 are explanatory diagrams of 
the signal levels for describing the operation of the 
echo canceller within a known echo processing 
unit, and the voice detection means of the cellular 
telephone proper; 

Rg. 17A and Fig. 17B are explanatory diagrams of 
the signal levels for describing the operation of the 
echo suppresser within a known echo processing 
unit, arKl the voice detection means of the cellular 
telephone proper. 
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DESCRIPTION OF THE PREFERRED EMBODI- 
MENTS 

First Embodiment " * ' 

Fig. 1 and Fig. 2 illustrate the overall construction ot 
a hands-free telephone according to a first embodiment 
of the present invention, illustrating the case in whidh a 
loudspeaker, hands-free micrcphone, and echo 
processing unit are connected to a cellular telephone to 
be employed as a loudspeaker telephone. Fig. 1 illus- 
trates the case of an analog cellular telephone, and Fig. 
2. that of a digital cellular telephone. In Fig. 1 and Fig. 2. 
the portions identical to those of the known example 
shown in Fig. 14 and Fig. 15 are denoted by the same 
reference numerals, and description thereof is omitted. 

Also, the hands-free telephone according to the first 
embodiment illustrated in Fig. 1 and Rg. 2 differs from 
the known example shown in Fig. 14 and Fig. 15 in the 
following aspects: Voice detection means 33 are not 
provided within the cellular telephone proper 30, and an 
echo processing unit 25A relating to the first embodi- 
ment has built-in voice detecting means which outputs a 
voice detection flag FLG, based on comparison 
between the voice detection threshold value which 
changes according to change in the reception signal 
level and the residual signal level. Also, the wireless unit 
28A functions as transmission control means for per- 
forming transmission control of the aforementioned out- 
put transmission signal S2(k) by receiving power supply 
only when the aforementioned voice detection flag FLG 
indicates presence of voice. 

Fig. 1 and Fig. 2 show an example where the wire- 
less unit 28A is provided with the functions as transmis- 
sion control means, but a non-wireless unit which per 
forms wire processing can be applied to the present 
invention as well, besides the wireless arrangement. 

Now, an example wherein an echo canceller is pro- 
vided to the aforementioned echo processing unit 25A 
to prevent residual echo within the residual signal which 
is the output transmission signal being mis-identified as 
the voice of the near-end caller, will be described with 
reference to Fig. 3. 

The echo canceller shown in Rg. 3 is comprised of 
an adaptive fitter 4 for taking input of the reception sig- 
nal R(k) and the input transmission signal S(k) and out- 
putting a residual signal U(k) comprised of the 
aforementioned input transmission signal S(k) with the 
echo component thereof removed, level calculating 
means 2 for calculating the level of the aforementioned 
residual signal U(k), voice detection threshold value 
control means 6 for controlling the voice detection 
threshold value TH(k), voice detection means 7 for out- 
putting a voice detection flag FLG which indicates the 
presence of voice when the level Lu(k) of the aforemen- 
tioned residual signal U(k) exceeds the voice detection 
threshold value TH(k) output from the aforementioned 
voice detection threshold value control means 6, and 



level calculating means 1 for calculating the level of the 
reception signal R(k). Then, the aforementioned voice 
detection threshold value control means 6 controls the 
voice detection - threshold value TH{k) according to 

5 change in the level Lr(k) of the reception signal output 
from the aforementioned level calculating means 1 . 

i.e.. as shown in Fig. 3. the adaptive filter 4 uses 
the input tiansmissioh signal S(k) and reception signal 
R(k) to cancel the echo component within the Input 

10 transmission signal S(k) and output the residual signal 
U(k). Now, the residual signal U(k) is equivalent to the 
output transmission signal S2(k) shown in Fig. 1 and 
Fig. 2. The level calculating means 1 and 2 use the 
rec^ion signal R(l^ and the residual signal U(k) to cal- 

75 culate and output the reception signal level Lr{k) and the 
residual signal level Lu(k). Here, the term "level" refers 
to e.g., a logarithm conversion value of power. 

The voice detection threshoid^value control means 
6 sets the voice detection threshold value TH(k) in 

20 accordance with the reception signal level Lr(k), and 
provides the voice detection means 33 with this value, 
i.e.. the voice detection threshold value control means 6 
sets the voice detection threshold value TH(k) as 
expressed by: 

25 

TH(k) = Lr(k) - p 1 

However, in order to provide a lower limit for TH(k). 
in the event that Lr(k) ^ p 1 + 0 1 . the setting is made 

30 such that TH(k) = 61. Here, p 1 and e 1 are both con- 
stants, with 6 1 being a constant for determining the 
lower limit of TH(k). 

As a result, the voice detection threshold value 
TH(k) is provided to the voice detection means 7 such 

35 that the voice detection threshold value TH(k) is great 
when the receptions signal level Lr(k) is great, and the 
voice detection threshold value TH(k) is small when the 
receptions signal level Lr(k) is small. In the event that 
the residual signal level Lu(k) exceeds the voice detec- 

40 tion threshold value TH(k). the voice detection flag FLG 
is set to 1 and output, and in the event that the residual 
signal level Lu(k) does not exceed the voice detection 
threshold value TH(k). the voice detection flag FLQ is 
set to 0 and output. 

45 The reason that the voice detection threshold value 
control means 6 sets the voice detection threshold value 
TH(k) according to the magnitude of the reception sig- 
nal level Lr(k) Is due to the following reason: i.e., in the 
event that the reception signal level Lr(k) is great, a 

so good deal of residual echo component remains within 
the residual signal U(k) even if a certairi amount of echo 
canceling has been performed to the input transmission 
signal S(k) by the adaptive filter 4. so that the residua! 
signal level Lu(k) indicates the level of a superimposed 

55 signal of the voice of the near-end caller and of the 
residual echo component. 

Accordingly, the voice detection means 33 mis- 
identifies the residual echo component within the resid- 
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ual signal U as the voice of the near-end caller, unless 
the voice detection threshold value TH(k) is set suffi- 
ciently great. 

On the other hand, in the event that the reception 
signal level Lr(k) is small, the amount of residual echo s 
component remaining within the residual signal U(k) 
due to echo canceling having been performed by the 
adaptive filter 4 is extremely small, so that the residual 
signal level Lu(k) accurately indicates the level of the 
voice of the near-end caller. 

By setting the voice detection threshold value TH(k) 
to a low level at this time, the voice detection means 33 
responds to the voice of the near-end caller within the 
residual signal U(l^ in a sensitive manner, thus allowing 
detection of minute consonant pronunciations at the is 
beginning of words. 

As can be understood from Fig. 4A through Fig. 4C 
describing the operation of the present first embodi- 
ment, the voice detection threshold value TH(k) also 
changes in accordance with the change in the reception 20 
signal level Lr(k), so that even in the event that the 
reception signal level Lr(k) is great, the residual signal 
level Lu(k) obtained based on the input transmission 
signal level Ls(k) and the reception signal level Lr(k) 
never exceeds the voice detection threshold value 25 
TH(k). Accordingly, the voice detection flag FLG 
remains set to 0. 

As described above, according to the present first 
embodiment, near-errorless detection of the voice of the 
near-end caller becomes possible regardless of the so 
reception signal level, and the wireless unit 28A per- 
forms transmission control of output transmission sig- 
nals such that power supply Is received only when the 
voice detection flag FLG indicates presence of voice. 
Thus, the electrical power consumption efficiency is 3s 
raised, and the far-end caller does not received an 
unnatural innpression. 

Also, with the echo canceller of the echo process- 
ing unit 25A relating to the present first embodiment, the 
voice detection flag FLG is output externally from the 40 
echo canceller, so in connparison with known loud- 
speaker telephones which use the output signal of the 
echo canceller as the voice detection means input, a 
loudspeaker telephone can be constructed wherein the 
effects of the reception signal level Lr(k) and the amount 45 
of echo canceling performed by the adaptive filter 4, do 
not easily cause mis-identifying of the voice of the near- 
end caller. 

Further, the voice detection threshold value control 
means 6 is such that a slight echo component remains so 
in the residual signal U(k) even after echo cancellation 
is performed to the input transmission signal S(k) by the 
adaptive f IHer 4. owing to the reception signal level Lr(k) 
being great. Thus, in the event that the residual signal 
level does not accurately represent the level of the near- ss 
^ end caller alone, the voice detection threshold value 
TH(k) is set so as to be great as to the residual signal. 
On the other hand, if the receptbn signal level Lr(k) is 



small, there is practically no echo component remaining 
in the residual signal U(k) even after echo cancellation 
is performed to the input transmission signal S(k) by the 
adaptive filter 4. Hence, In the event that the residual 
signal level accurately represents the level of the voice 
of the near-end caller alone, the voice detection thresh- 
old value TH(k} is set so as to be small as to the residual 
signal. Consequently, the voice detection means 7 is 
capable of performing detection of the voice of the near- 
end caller from the residual signal level, without being 
affected by the reception signal level. 

Second Embodiment 

Next. Fig. 5 is a constructional diagram illustrating 
the echo canceller provided to the echo processing unit 
25A of the second embodiment according to the present 
invention, illustrating an echo canceller which does not 
mis-identify the residual echo component within the 
residual signal with the voice of the near-end caller. 

In Fig. 5, the portions identical to those of the echo 
canceller according to the first embodiment shown in 
Fig. 3 are denoted by the same reference numerals, and 
description thereof Is omitted. New reference numerals 
are as follows: reference numeral 3 denotes a level cal- 
culating means of calculating the level of the input trans- 
mission signal S(k), reference numeral 5 denotes echo 
cancellation amount calculation means for calculating 
the echo cancellation amount D(k) from the aforemen- 
tioned input transmission signal level Ls{k) and residual 
signal level Lu(k) from the level calculating means 2, 
and the voice detection threshold value control means 6 
is arranged to control the voice detection threshold 
value according to the level change of the echo cancel- 
lation amount D(k) output from the aforementioned 
echo cancellation amount calculation means 5. 

i,e., in the present second embodiment, the level 
calculating means 3 calculates and outputs the input 
transmission signal level Ls(k) from the input transmis- 
sion signal S(k). The echo cancellation amount calcula- 
tion means 5 calculates the echo cancellation amount 
D(k) from the aforementioned input transmission signal 
level Ls(l^ and residual signal level Lu(k) according to 
the following expression. 

D(k) = Ls(k) - Lu(k) 

Also, the voice detection threshold value control 
means 6 sets the voice detection threshold value TH(k) 
according to the echo cancellation amount D(k) and out- 
puts to the voice detection means 7. 
i.e., in the event that 

D(k) < y 2 

and 

Lu(k) > y 3 
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bdth hoild, the setting is 

TH{k) = 6 2, 



and in all other cases, the setting is s 

TH(k) = 6 3, 

wherein y 2, y 3, 6 2, and 6 3 are each constants, 
with 0 2 and 6 3 being in the relation of 9 2 < 6 3. 

Consequently, in the event that the echo cancella- 
tion amount D(k) is small, the voice detection threshold 
value TH(k) Is set to be high and provided to the voice 
detection means 7, and in the event that the echo can- 
cellation amount D(k) is great, the voice detection 
threshold value TH(k) Is set to be low and provided to 
the voice detection means 7. 

The reason that the voice detection threshold value 
control means 6 sets the voice detection threshold value 
TH(k) according to the echo cancellation amount D(k) is 
due to the following reason: i.e.. in the event that the 
echo cancellation amount D(k) is small, a good deal of 
residual echo component remains within the residual 
signal U(k), so that the residual signal level Lu{k) indi- 
cates the level of a superimposed signal of the voice of 
the near-end caller and of the residual echo component. 
Accordingly, the voice detection means 7 mis-identifies 
the residual echo component within the residual signal 
U{k) as the voice of the near-end caller, unless the voice 
detection threshold value TH(k) is set sufficiently great 

On the other hand, in the event that the echo can- 
cellation amount D(k) is great, the amount of residual 
echo component remaining within the residual signal 
U(k) is extremely small, so that the residual signal level 
Lu(k) accurately indicates the level of the voice of the 
near-end caller. By setting the voice detection threshold 
value TH(k) to a low level at this time, the voice detec- 
tion means 7 responds to the voice of the near-end 
caller within the residual signal U(k) in a sensitive man- 
ner, thus allowing detection of minute consonant pro- 
nunciations at the beginning of words. 

As can be understood from Fig. 6A and Fig. 6B 
describing the operation of the present errtoodiment, the 
voice detection threshold value TH(k) also changes in 
accordance with the change in the echo cancellation 
amount D(k) from the aforementioned input transmis- 
sion signal level Ijs(k) and residual signal level |ju(k) as 
shown In Fig. 6A so that even in the event that the echo 
cancellation amount D(k) is small, the residual signal 
level Lu(k) never exceeds the voice detection threshold 
value TH(k). Accordingly, as shown in Fig. 6B. the voice 
detection flag FLG remains set to 0. 

As described above, according to the present sec- 
ond embodiment, excellent detection of the voice of the 
near-end caller can be obtained with little error, regard- 
less of the magnitude of the echo cancellation amount. 

In other words, in the event that the echo cancella- 
tion amount D(l^ is small, i.e., in the event that there is 



residual echo component within the residual signal U(k). 
resulting in the residual signal level not accurately rep- 
resenting the level of the near-end caller alone, the 
voice detection threshold value TH(k) is set great as to 
the residual signal U(l^. On the other hand, in the event 
that the echo cancellation amount D(k) is great, i.e., in 
the event that there is practically no residual echo com- 
poheht'within the residual signal U(k). resulting in the 
residual signal level accurately representing the level of 
the near-end caller alone, the voice detection threshold 
value control means 6 sets the voice detection threshold 
value TH(k) small as to the residual signal U(k). Conse- 
quently, the voice detection means 7 is capable of per- 
forming detection of the voice of the near-end caller 
from the residual signal level, without being affected by 
the amount of echo cahcellatioh by the adaptive filter 4. 

Third Embodiment 

|vJe>ct, Fig. 7 is a constructional diagram illustrating 
the echo canceller provided to the echo processing unit 
25A of the third embodiment according to the present 
invention, illustrating an echo canceller which does not 
mis-identify the residual echo component within the. 
residual signal which is the output transmission signal, 
with the voice of the near-end caller. 

In Fig. 7. the portions identical to those of the echo 
canceller according to the second embodiment shown 
in Fig. 5 are denoted by the same reference numerals, 
and description thereof is omitted. In comparison with 
the echo canceller according to the second embodiment 
shown in Fig. 5, the echo canceller according to the 
third embodiment shown in Rg. 7 has the level calculat- 
ing means 1 for calculating the level of the reception sig- 
nal R(k) similar to that of the first embodiment shown in 
Fig. 3 Also, the voice detection threshold value control 
means 6 is arranged so as to control the voice detection 
threshold value according to a value wherein the echo 
cancellation amount calculated by the echo cancellation 
amount calculation means 5 is sufcrtracted from the level 
of the reception signal R(i^. 

i.e., in this third embodiment, the voice detection 
threshold value control means 6 sets the voice detection 
threshold value TH(k) represented in the following 
expression according to the value obtained by subtract- 
ing the echo cancellation amount D(k) from the level 
Lr(k) of the reception signal R(k), and provides this to 
the voice detecting means 7. 

TH{k) = U(k) - D(k) - p 2 

However, in order to provide a lower limit for TH(I^, 
in the event that 

U(k) - D(l^ ^ p 2 + 6 1 

holds, the setting is 
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TH(k) = e 1, 

Here, p 2 and e 1 are constants, with e 1 being a 
constant for determining the lower limit for TH(k). Con- 
sequently, in the event that the value obtained by sub- 
tracting the echo cancellation amount D(k) from the 
reception signal level Lr(k) is great, the voice detection 
threshold value TH(k) is set so as to be great, and this 
value is provided to the voice detecting means 7. On the 
other hand, in the event that the value obtained by sub- 
tracting the echo cancellation amount D(k) from the 
reception signal level Lr(k) is small, the voice detection 
threshold value TH(k) is set so as to be small, and this 
value is provided to the voice detecting means 7. 

The reason that the voice detection threshold value 
control means 6 sets the voice detection threshold value 
TH{k) according to the value obtained by subtracting the 
echo cancellation amount D(k) from the reception signal 
level Lr(k) is due to the following reason: i.e.. in the 
event that the value obtained by subtracting the echo 
cancellation amount D{k) from the reception signal level 
Lr(k) is great, a good deal of residual echo component 
remains within the residual signal Ll(k), so that the resid- 
ual signal level Lu(k) indicates the level of a superim- 
posed signal of the voice of the near-end caller and of 
the residual echo component. Accordingly, the voice 
detection means 7 mis-identifies the residual echo com- 
ponent within the residual signal U(k) as the voice of the 
near-end caller, unless the voice detection threshold 
value TH(k) is set sufficiently great. 

On the other hand, in the event that the value 
obtained by subtracting the echo cancellation amount 
D(k) from the reception signal level Lr{k) is small, the 
amount of residual echo component remaining within 
the residual signal U(k) is extremely small, so that the 
residual signal level Lu(k) accurately indicates the level 
of the voice of the near-end caller. By setting the voice 
detection threshold value TH(k) to a low level at this 
time, the voice detection means 7 responds to the voice 
of the near-end caller within the residual signal U(k) in a 
sensitive manner, thus allowing detection of minute 
vowel pronunciations at the beginning of words. 

As can be understood from Fig. 8 A through Fig. 8C 
describing the operation of the present third embodi- 
ment, the voice detection threshold value TH(k) also 
changes in accordance with the change in the value of 
the echo cancellation amount D(k) from the aforemen- 
tioned input transmission signal level Ls(k) and residual 
signal level l_u(k) subtracted from the reception signal 
level Lr{l^ as shown in Fig. 8B as opposed to the recep- 
tion signal level Lr(k) shown in Fig .8A, so that even in 
the event that the value obtained by subtracting the 
echo cancellation amount D(k) from the reception signal 
level Lr(k) is great, the residual signal level Lju(k) never 
exceeds the voice detection threshold value TH(k), 
Accordingly, as shown in Fig. 8C, the voice detection 
flag FLG remains set to 0. 

As described above, according to this third embod- 



iment, excellent detection of the voice of the near-end 
caller can be obtained with little error, regardless of the 
magnitude of the reception signal level and of the echo 
cancellation amount. 

5 In other words, in the event that the value obtained 
l>y subtracting the echo cancellation amount D(k) from 
the reception signal level Lr(k) is great, the voice detec- 
tion threshold value control means 6 sets the voice 
detection threshold value TH(k) as to the residual signal 

10 U(k) so as to be great On the other hand, in the event 
that the value obtained by subtracting the echo cancel- 
lation amount D{k) from the reception signal level Lr(k) 
is small, the voice detection threshold value TH(k) as to 
the residual signal U(l^ is set so as to be small. Gonse- 

75 quently, the voice detection means 7 is capable of per- 
forming detection of the voice of the near-errd caller 
from the residual signal level, without being affected by 
the reception signal level Lr(k) and by the amount of 
echo cancellation by the adaptive filter 4. 

20 

Fourth Embodiment 

Next, Fig. 9 is a constructional diagram illustrating 
the echo suppresser provided to the echo processing 

25 unit 25A of the fourth embodiment according to the 
present invention, illustrating an echo suppresser which 
does not mis-Identify the change in the level of transmis- 
sion signals due to operation of the attenuator with the 
voice of the near-end caller. 

30 In Fig. 9. the level calculating means 1 and the 
voice detection threshold value control means 6 are the 
same as those of the first embodiment shown in Fig. 3, 
and the level calculating means 3 is the same as that of 
the second embodiment shown in Fig. 5. Hence, 

35 description thereof is omitted. New reference numerals 
are as follows: Reference numerals 9 and 10 denote the 
attenuator control means and attenuator, wherein the 
attenuator control means 9 controls the suppression 
amount AT(k) of the attenuator 10 based on the recep- 

40 tion signal level Lr(k). Thus, the attenuator 10 is 
arranged so as to suppress the level of the input trans- 
mission signals S(l^ according to the suppression 
amount AT(k) controlled by the attenuator control 
means 9, and output suppression transmission signals 

45 W(k). In the event that the input transmission signal 
level Ls(k) exceeds the voice detection threshold value 
TH(k). the voice detection means 7 sets the voice detec- 
tion flag FLG to 1 and outputs, and in the event that the 
input transmission signal level Ls(k) does not exceed 

so the voice detection threshold value TH(k). the voice 
detection means 7 sets the voice detection flag FLG to 
0 and outputs. 

Now, the aforementioned attenuator control means 
9 determines the suppression amount as described 

55 below: 

(Process 1 ) In the event that Lr(k) s a , the suppres- 
sion amount AT(k) is set to 0. 
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{Process"2) In the event that Lr M >"a ;the sup-- 
pressioii amount AT(k) is set to AL. 

Here, the constant a denotes the threshold value' 
for determining the presence of voice for the reception 
signal which is the voice of the far-end caller, AL 
denotes the suppression arnourrt of the ^ttenuator, 
Process i is a proc«jure unde7tel^nTri ffie e^^^ 
has been judged that the far-end caller is not speaking, 
and Process 2 is a procedure undertaken in the event 
that it has been judged that the far-end caller is speak- 
ing. 

The attenuator 10 suppresses the input transmis- 
sion signal S(k) in accordance with the suppression 
amount AT(k), and outputs the suppression transmis- 
sion signal W(k). Here, the suppression transmission 
signal W(k) is equivalent to the output transmission sig- 
nal S2(k) shown in Fig. 1 and Fig. 2. 

The point of greatest importance in the present 
embodiment is the operation of the voice detection 
threshold value control means 6 and the voice detection 
means 7, wherein near-end caller voice detection is not 
performed from the output transmission signal level as 
with known exannples, i.e.. from the suppression trans- 
mission signal level Lw(k). Rather, a voice detection 
threshold value TH(k) is set based on the reception sig- 
nal level Lr(k). and near-end caller voice detection is 
performed using the input transmission signal level 
Ls(k) and the voice detection threshold value TH(k). 

As can be understood from Fig. 10A through Fig. 
10C describing the operation of the present fourth 
embodiment, the suppression amount AT(f^ Is control- 
led based on the reception signal level Lr(k) shown in 
Fig. 10A, and as shown in Fig. 10B. the input transmis- 
sion signal level Ls(k) is suppressed according to the 
suppression amount AT(k) and suppression transmis- 
sion signals of the suppression transmission signal level 
Lw(k) are output. Consequently, there is no mis-identify- 
ing the change in the suppression transmission signal 
level Lw(k) due to operation of the attenuator with the 
voice of the near-er)d caller. Accordingly, as shown in 
Fig. IOC, the voice detection flag FLG remains set to 0. 

As described above, with the echo suppresser 
relating to the present fourth embodiment, the voice 
detection flag FLG is output externally from the echo 
suppresser, so in comparison with known loudspeaker 
telephones which use the output signal of the echo sup- 
presser as the voice detection means input, a loud- 
speaker telephone can be constructed wherein the 
effects of change in the reception signal level does not 
easily cause mas-identifying of the voice of the near-end 
caller. 

Also, since the voice detection means 7 determines 
the presence of voice of the near-end caller based on 
the input transmission signal level Ls(k) and the voice 
detection threshold value TH(k), determination of the 
voice detection flag FLG is not affected by the suppres- 
sion processing of the attenuator 1 0 regarding the input 



transmission signal S(k). i.e., as describe above, even if 
^ the attenuator 10 changes the level of the transmission 
signal based on the relation of the reception signal level 
Lr(k) and the input transmission signal level Ls(k), the 

5 change is not mis-identified as starting or ending of 
speech by the near-end caller, thus enabling proper out- 
put of the voice detection flag FLG. 

Furthiar. in bftier words, tife voioe detec^^ 

put externally from the echo suppresser is obtained by 

10 inputting the input transmission signal level L5(k) and 
the voice detection threshold value TH(k) to the voice 
detection means 7, even in the event that the suppres- 
sion amount AT(k) of the transmission signals is 
changed by the attenuator 10 due to speech by the far- 

15 end caller, the voice detection flag FLG is not effected at 
all by that change, i.e., a loudspeaker telephone can be 
constructed wherein mis-identification of the near-end 
caller voice does not easily occur due to the presence or 
absence of reception signals, as compared to known 

20 systems wherein output transmission signals from the 
echo suppresser with suppression performed thereto 
are input to the voice detection means. 

Fifth Embodiment 

25 

Next. Fig. 1 1 is a constructional diagram illistrating 
the echo canceller provided to the echo processing unit 
25A of the fifth embodiment according to the present 
invention, illustrating an echo canceller which does not 

30 mis-identify the residual echo component within the 
residual signal which is the output transmission signal 
with the voice of the near-end caller, nor mis-identify the 
change in the level of transmission signals due to oper- 
ation of the attenuator with the voice of the near-end 

35 caller. 

In Fig. 1 1 , the level calculating means 1 through 3, 
the adaptive filter 4, the echo cancellation amount cal- 
culation means 5. the voice detection threshold value 
control means 6, and the voice detection means 7 are 

40 the same as those of the third embodiment shown in 
Fig. 7, and description thereof Is omitted. Additions to 
the third embodiment shown in Rg. 7 are the attenuator 
control means 9 and attenuator 10 the same as those of 
the fourth embodiment shown in Fig. 9. 

45 Now. the attenuator control means 9 determines 
the suppression amount AT(k) of the attenuator 10 
based on the reception signal level Lr(k) as described 
below: 

50 (Process 1) In the event that Lr (k) £ a , the sup- 
pression amount AT(k) is set to 0. 
(Process 2) In the event that Lr (k) > a . the sup- 
pression amount AT(k) is set to AS (< AL). 

55 Here, the constant a denotes the threshold value for 
determining the presence of voice for the reception sig- 
nal which is the voice of the far-end caller. AS denotes 
the suppression amount of the attenuator 10. Process 1. 
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is a procedure undertaken in the event that it has been 
judged that the far-end caller is not speaking, and Proc- 
ess 2 is a procedure undertaken in the event that it has 
been judged that the far-end caller is speaking. 

Thus, the attenuator 10 is arranged so as to sup- 
press the residual signals U(k) from the adaptive filter 4 
according to the suppression amount AT{k) determined 
as described above, and output suppression transmis- 
sion signals W(k). 

Next, the operation of the present fifth embodiment 
will be made with reference to Fig. 12A through Fig. 
12C, As can be understood from Fig. 12A through Fig. 
12C, the voice detection threshold value TH(k) changes 
in accordance with the change in the value of the echo 
cancellation amount D(k) subtracted from the reception 
signal level Lr(k) as shown in Fig. 12B as opposed to the 
reception signal level Lr(k) shown in Rg .12A, in the 
same manner as that shown in Fig. 8B illustrating the 
third embodiment Thus, even in the event that the value 
obtained by subtracting the echo cancellation amount 
D(k) from the reception signal level Lr(k) is great, the 
residual signal level Lu(k) never exceeds the voice 
detection threshold value TH(k). Accordingly, as shown 
in Fig. 12C. the voice detection flag FLG remains set to 
0. 

i.e.. since the system performs detection of the 
voice of the near-end caller based on the residual signal 
level Lu(k) which is the input off the attenuator 10, 
instead of performing detection of the voice of the near- 
end caller based on the suppression transmission sig- 
nal level Lw(k) which is the output of the attenuator 10, 
the change in the suppression transmission signal level 
Lw(k) due to the attenuator 10 is not mis-identified as 
the voice of the near-end caller, as can be understood 
from Fig. 12A through Rg. 12C. 

As described above, according to this fifth embodi- 
ment, excellent near-end caller voice detection can be 
performed regardless of the magnitude of the reception 
signal level and of the echo cancellation amount. Also, 
the suppression amount of the attenuator can be 
changed without effecting at all the voice detection flag 
which is output externally from the echo canceller. 

In other words, since the voice detection flag FLG 
output externally from the echo canceller is obtained by 
irputting the residual signal level Lu(k) and the voice 
detection threshold value TH(k) to the voice detection 
means 7. even in the event that the suppression amount 
AT(k) of the residual signals is changed by the attenua- 
tor 10 due to speech by the far-end caller, the voice 
detection flag FLG is not effected at ail by that change, 
i.e.. a loudspeaker telephone can be constructed 
wherein mis-Identification of the near-end caller voice 
does not easily occur due to the presence or absence of 
reception signals, as compared, to known systems 
wherein output signals from the echo canceller with 
suppression performed thereto are input to the voice 
detection means. 



Sixth Embodiment 

Next, Fig. 13 is a constructional diagram illustrating 
the echo canceller within the echo processing unit 25A 

5 in a sixth embodiment according to the present inven- 
tion. This echo canceller is arranged so as to be even 
more sure in the following points: that the built-in noise 
canceller only performs estimation of background noise 
properties during sections wherein the near-end caller 

10 is not speaking; and control of the noise cancellation 
amount is performed according whether or not there is 
speech by the near-end caller. 

As compared with the fifth embodiment shown in 
Fig. 11, a new construction is introduced here in the 

15 sixth ennbodlment. wherein a noise cemceller 1 1 is fur- 
ther provided between the adaptive filter 4 and the 
attenuator 10. to output to the aforementioned attenua- 
tor 10 noise cancellation transmission signals V(k) com- 
prised of the residual signals Ll(k) with the noise thereof 

20 canceled, according to the voice detection flag, estima- 
tion of the background noise properties having been 
made when the voice detection flag FLG output from the 
voice detection means 7 indicates that the near-end 
caller is not speaking. Then, the attenuator 10 is 

25 arranged so as to suppress the aforementioned noise 
cancellation transmission signals V(k) according to the 
suppression amount AT(k) set by the aforementioned 
attenuator control means 9 and output suppression 
transmission signals W(k). 

30 i.e., the noise canceller 11 is such that takes of 

the residual signals U(k) output from the adaptive filter 4 
and the voice detection flag FLG output from the voice 
detection means 7 as input, removes the near-end 
background noise contained in the residual signal U(k), 

35 and outputs noise cancellation transmission signals 
V(k). 

The noise canceller 1 1 performs estimation of the 
background noise properties when the voice detection 
flag FLG output from the voice detectton means 7 indi- 

40 cates that the near-end caller is not speaking, and con- 
trols the noise cancellation amount according to the 
voice detection flag FLG. Then, the attenuator 10 sup- 
presses the noise cancellation transmission signals 
V(k)according to the suppression amount AT(k) deter- 

45 mined by the attenuator control means 9, and outputs 
suppression transmission signals W(k). 

As described above, according to this sixth embod- 
iment, as with the above-described fifth embodiment, 
the suppression amount of the attenuator 10 can be 

50 changed without effecting the voice detection flag FLG 
which is output externally from the echo canceller, so 
the noise canceller 1 1 which uses this voice detection 
flag as the input thereof is not affected by the operation 
of the attenuator 10 at all. and is capable of performing 

55 estimation of the background noise properties when the 
near-end caller is not speaking, and controls the noise 
cancellation amount depending on whether the near- 
end caller is speaking. Accordingly, it becomes possible 
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to realize noise cancellation processing which does not : 
give the lar-end caller an unnatural impression. 

In other words, the noise canceller 1 1 is arranged to 
refer to the voice" detection flag" FLQ- which" is-not" 
affected by the operation of the attenuator 10, and thus 
is capable of performing estimation of the background 
noise properties when the near-end caller is not speak- 
ing and ithe fnpU are ohiy bad^- 
grouiid noise, arid controlling the noise cancellation 
amount without effects of the operation of the attenuator 
10, according to the input signals are voice signals or 
not. Accordingly, it becomes possible to realize noise 
cancellation processing which does not give the tar-end 
caller an unnatural Impression. 

As described above, according to the present 
invention, a hands-free telephone comprises: a hands- 
free microphone which converts the voice of a near-erKt 
caller into electrical signals and outputs as input trans- 
mission signals: a loudspeaker which is provided near 
the above microphone and outputs reception signals 
which are the voice of the far-end caller into the air as 
audible sound waves; an echo processing unit which 
outputs as an output transmission signal a residual sig- 
nal comprised of the input transmission signal of the 
near-end caller input through the aforementioned micro- 
phone, the echo component thereof having been 
removed, and also which outputs the reception signals 
from the far-end caller to the aforementioned loud- 
speaker, and further which outputs a voice detection 
flag based on connparison between a voice detection 
threshold value which changes according to change in 
level of the aforementioned reception signal and the 
level of the aforementioned residual signal; and trans- 
mission control means for performing transmission con- 
trol of the aforementioned output transmission signals 
only when the aforementioned voice detection flag indi- 
cates presence of voice, the aforementioned transmis- 
sion control being performed by receiving power supply 
only when the aforementioned voice detection flag indi- 
cates presence of voice. Thus, a hands-free telephone 
can be provided wherein near-end caller voice detection 
information is provided with little error, and wherein 
electric power saving effects are improved, and also 
wherein the far-end caller does not receive an unnatural 
impression. 

Also, the aforementioned echo processing unit 
comprises an echo canceller, comprising: an adaptive 
filter for taking input of the aforementioned reception 
signal and the aforementioned input transmission signal 
and outputttng a residual signal comprised of the afore- 
mentioned input transmission signal with the echo conv 
ponent thereof removed; first level calculating means for 
calculating the level of the aforementioned residual sig- 
nal; voice detection threshold value control means for 
controlling the aforementioned voice detection thresh- 
old value; and voice detection means for outputting a 
voice detection flag which indicates the presence of 
voice when the level of the aforementioned residual sig- 



nal exceeds the voice detection threshold value output 
from the aforementioned voice detection threshold 
value control means. Thus, by outputting the voice 
— detectionflag externally from the echo canceller, a loud- 
5 speaker telephone can be constructed wherein the 
effects of the reception signal level and the amount of 
echo canceling performed by the adaptive filter do not 
easily cause mis-idontifying of the voice of the near-end 
caller voice. 

10 Further, the aforementioned echo canceller further 
comprises second level calculating means for calculat- 
ing the level of the aforementioned reception signal, and 
also the aforementioned voice detection threshold value 
control means controls the aforementioned voice detec- 
75 tion threshold value according to the change in level of 
the reception signal output from the aforementioned 
second level calculating means. Tnus, in the event that 
a slight echo component remains in the residual signal 
even after echo cancellation is performed to the input 
20 transmission signal by the adaptive filter, owing to the 
reception signal level being great, so that the resrdual 
signal level does not accurately represent the level of 
the near-end caller alone, the voice detection threshold 
value is set so as to be great as to the residual signal. 
25 On the other hand, if the reception signal level is small, 
there is practically no echo component remaining in the 
residual signal after echo cancellation is performed to 
the input transmission signal by the adaptive filter. 
Hence, In the event that the residual signal level accu- 
se rately represents the level of the near-end caller alone, 
the voice detection threshold value is set so as to be 
small as to the residual signal. Consequently, the voice 
detection means is capable of performing detection of 
the voice of the near-end caller from the residual signal 
35 level, without being affected by the reception signal 
level. 

Further yet, the aforementioned echo canceller fur- 
ther comprises third level calculating means for calculat- 
ing the level of the aforementioned input transmission 
40 signal and echo cancellation amount calculating means 
for calculating the amount of echo cancellation based 
on the level of the aforementioned input transmission 
signal and the level of the aforementioned residual sig- 
nal, and also the aforementioned voice detection 
45 threshold value control means controls the aforemen- 
tioned voice detection threshold value according to the 
change in level of echo cancellation amount output from 
the aforementioned echo cancellation amount calculat- 
ing means. Thus, in the event that the echo cancellation 
so amount is small, i.e., in the event that there is residual 
echo component within the residual signal, resulting in 
the residual signal level not accurately representing the 
level of the near-end caller alone, the voice detection 
threshold value is set great as to the residual signal. On 
55 the other hand, in the event that the echo cancellation 
amount is great, i.e., in the event that there is practically 
no residual echo component within the residual signal, 
resulting in the residual signal level accurately repre- 
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senting the la/el of the near-end caller alone, the voice 
detection threshold value is set small as to the residual 
signal. Consequently, the voice detection means is 
capable of performing detection of the voice of the near- 
end caller from the residual signal level, without being 
affected by the amount of echo cancellation by the 
adaptive filter. 

Furthermore, the aforementioned echo canceller 
further comprises second level calculating means for 
calculating the level of the aforementioned reception 
signal, and also the aforementioned voice detection 
threshold value control means controls the aforemen- 
tioned voice detection threshold value according to a 
value of the reception signal level from which the echo 
cancellation amount has been subtracted. Thus, in the 
event that the value obtained by subtracting the echo 
cancellation amount from the reception signal level is 
great the voice detection threshold value is set so as to 
be great to the residual signal, and in the event that the 
value obtained by subtracting the echo cancellation 
amount from the reception signal level is small, the 
voice detection threshold value is set so as to be small 
to the residual signal. Consequently, the voice detection 
means is capable of performing detection of the voice of 
the near-end caller from the residual signal level, with- 
out being affected by the reception signal level and by 
the amount of echo cancellation by the adaptive filter. 

Moreover, the aforementioned echo processing unit 
comprises an echo suppresser, comprising: second 
level calculating means for calculating the level of the 
aforementioned reception signal; voice detection 
threshold value control means for controlling the afore- 
mentioned voice detection threshold value according to 
change in the level of the aforementioned reception sig- 
nal: third level calculating means for calculating the level 
of the aforementioned input transmission signal; voice 
detection means for outputting a voice detection flag 
which indicates the presence of voice when the level of 
the aforementioned input transmission signal exceeds 
the voice detection threshold value output from the 
aforementioned voice detection threshold value control 
means; an attenuator for suppressing the aforemen- 
tioned input transmission signal according to a certain 
suppression amount and outputting suppression trans- 
mission signals; and attenuator control means for con- 
trolling the suppression amount set for the 
aforementioned attenuator based on the aforemen- 
tioned reception signal level. Thus, since the voice 
detection flag is output externally from the echo sup- 
presser. in comparison with known loudspeaker tele- 
phones which use the output signal of the echo 
suppresser as the voice detection means input, a loud- 
speaker telephone can be constructed wherein the 
effects of change in the reception signal level do not 
easily cause mis-identifying of the voice of the near-end 
caller. 

Further, the voice detection flag output externally 
from the echo suppresser is obtained by inputting the 



input transmission signal level and the voice detection 
threshold value to the voice detection means, so that 
even in the event that the suppression amount of the 
transmission signal is changed by the attenuator due to 

5 speech by the far-end caller, the voice detection flag is 
not effected at all by that change, i.e.. a loudspeaker tel- 
ephone can be constructed wherein mis-identification of 
the near-end caller voice does not easily occur due to 
the presence or absence of reception signal, as com- 

10 pared to known systems wherein output signals from 
the echo suppresser with suppression performed 
thereto are input to the voice detection means. 

Moreover yet, the aforementioned echo canceller 
further comprises an attenuator for suppressing the 

15 aforementioned residual signals according to a certain 
suppression amount and outputting suppression trans- 
mission signals, and attenuator control means for con- 
trolling the suppression amount set for the 
aforementioned attenuator based on the aforemen- 

20 tioned reception signal level. Thus, the attenuator can 
suppress the residual signals according to the suppres- 
sion amount determined according to the reception sig- 
nal level, and output suppression transmission signals. 
Also, since the voice detection flag output externally 

25 from the echo canceller is obtained by inputting the 
residua! signal level and the voice detection threshold 
value to the voice detection means, even in the event 
that the suppression amount of the residual signals is 
changed by the attenuator due to speech by the far-end 

30 caller, the voice detection flag is not effected at all by 
that change. I.e., a loudspeaker telephone can be con- 
structed wherein mis-identification of the near-end 
caller voice does not easily occur due to the presence or 
absence of reception signal, as compared to known 

55 systems wherein output signals from the echo canceller 
with suppression performed thereto are input to the 
voice detection means. 

Further yet, the aforementioned echo canceller fur- 
ther comprises a noise canceller provided between the 

40 aforementioned adaptive filter and the aforementioned 
attenuator, the noise canceller performing estimation of 
l3ackground noise properties at times when the voice 
detection flag output from the aforementioned voice 
detection means indicates that the near-end caller is not 

45 speaking, controlling the noise cancellation amount 
according to the aforementioned voice detection flag 
and outputting to the aforementioned attenuator a noise 
cancellation transmission signal connprised of the afore- 
mentioned residual signal from which noise has been 

50 removed, and also wherein the aforementioned attenu- 
ator suppresses the aforementioned noise cancellation 
transmission signal according to a suppression amount 
set by the aforementioned attenuator control means and 
outputs suppression transmission signals. Thus, the 

55 noise canceller is arranged to refer to the voice detec- 
tion flag FLG which is not affected by the operation of 
the attenuator, and hence is capable of performing esti- 
mation of the background noise properties when the 
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near-end caller is riot speaking and the input transmis- 
sion signals are only background noise, and controlling 
the noise cancellation amount without effects of the 
operation of the attenuator, according to the input sig--" -^ 
nals are voice signals or not. Accordingly, it becomes $ 
possible to realize noise cancellation processing which 
does not give, the far-end caller an unnatural impres- 
sibnT" 

Claims 

1 . A hands-free telephone, comprising: 

a hands-free microphone which converts the 
voice of a near-end caller into electrical signals is 
and outputs as input transmission signals; 
a budspeaker which is provided near the 
above microphone and outputs reception sig- 
nals which are the voice of the far-end caller 
into the air as audible sound waves; 20 
an echo processing unit which ou4>uts as an 
output transmission signal a residual signal 
comprised of the input transmission signal of 
the near-end caller input through said micro- 
phone, the echo component thereof having 25 
been removed, and also which outputs the 
reception signals from the far-end caller to said 
loudspeaker, and further which outputs a voice 
detection flag based on comjDarison between a 
voice detection threshold value which changes 30 
according to change in level of said reception 
signal and the level of said residual signal; and 
transmission control means for performing 
transmission control of said output transmis- 
sion signals only when said voice detection flag 35 
indicates presence of voice, said transmission 
control being performed by receiving power 
supply only when said voice detection flag indi- 
cates presence of voice. 

40 

2. The hands-free telephone according to claim 1, 
wherein said echo processing unit conrprises an 
echo canceller, comprising: 

an adaptive filter for taking input of said recep- 45 

tion signal and said input transmission signal 

and outputting a residual signal comprised of 

said input transmission signal with the echo 

component thereof removed; 

first level calculating means for calculating the so 

level of said residual signal; 

voice detection threshold value control means 

for controlling said voice detection threshold 

value; and 

voice detection means for outputting a voice ss 
detection flag which indicates the presence of 
voice when the level of said residual signal 
exceeds the voice detection threshold value 



output from said voice detection threshold 
value control means. 

3. The hands-free telephone according to daim 2. 
wherein said echo canceller further comprises sec- 
ond level calculating means for calculating the level 
of said reception signal, and also wherein said 

' voice "detecSbri thres^^ control means con- 

trols said voice detection threshold value according 
to the change in level of the reception signal output 
from said second level calculating means. 

4. The hands-free telephone according to daim 2. 
wherein said echo canceller further comprises third 
level calculating means for calculating the teyej of 
said input transmission signal and echo cancella- 
tion amount calculating means for calculating the 
amount of echo cancellation based on the level of 
said irput transmission signal and the level of said 
residual signal, and also wherein said voice detec* 
tion threshold value control means controls said 
voice detection threshold value according to the 
change in level of echo cancellation amount output 
from said echo cancellation amount calculating 
means. 

5. The hands-free telephone according to daim 4, 
wherein said echo canceller further comprises sec- 
ond level calculating means for calculating the level 
of said reception signal, and also wherein said 
voice detection threshold value control means con- 
trols said voice detection threshold value according 
to a value of the reception signal level from which 
the echo cancellation amount has been subtracted. 

6. The hands-free telephone according to daim 1. 
wherein said echo processing unit comprises an 
echo suppresser. comprising: 

second level calculating means for calculating 
the level of said reception signal; 
voice detection threshold value control means 
for controlling said voice detection threshold 
value according to change in the level of said 
reception signal; 

third level calculating means for calculating the 
level of said input transmission signal; 
voice detection means for outputting a voice 
detection flag which indicates the presence of 
voice v^en the level of said input transmission 
signal exceeds the voice detection threshold 
value output from said voice detection thresh- 
old value control means; 
an attenuator for suppressing said, input trans- 
mission signal according to a certain suppres- 
sion amount and outputting suppression 
transmission signals; and 
attenuator control means for corrtrolling the 
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suppression amount set for said attenuator 
based on said recepton signal level. 

7. The hands-free telephone according to claim 5, 
wherein said echo canceller further comprises an 5 
attenuator for suppressing said residual signal 
according to a certain suppression amount and oul- 
putting suppression transmission signals, and 
attenuator control means for contrdling the sup- 
pression amount set for said attenuator based on io 
said reception signal level. 

8. The hands-free telephone according to claim 7. 
wherein said echo canceller further comprises a 
noise canceller provided between said adaptive fil- is 
ter and said attenuator, said noise canceller per- 
forming estimation of background noise properties 

at times when the voice detection flag output from 
said voice detection means indicates that the near- 
end caller is not speaking, controlling the noise can- 20 
cellation amount according to said voice detection 
flag and outputting to said attenuator a noise can- 
cellation transmission signal comprised of said 
residual signal from which noise has been 
removed, and also wherein said attenuator sup- 25 
presses said noise suppression transmissions sig- 
nal according to a suppression amount set by said 
. attenuator control means and outputs suppression 
transmission signals. 

30 
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the voice detection threshold value acconding to the re- 
ception signal level and the echo cancellation amount; 
voice detection means 7 for outputting a voice detection 
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signal level, and attenuator 10 for performing suppres- 
sion according to the suppression amount. 
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